Tree-structured Maximum a Po for a Segment-based Speech R
نویسنده
چکیده
In this paper, the problem of the adaptation of a speech recognition system to a new environment is addressed. Recently, a Structural Maximum a Posteriori adaptation (SMAP) for a frame-based HMM model adaptation has been developed. In this method, acoustic model pdfs are organised in a tree and the means and variances of the pdfs are adapted using the linear transformations estimated under MAP criteria. In this paper, we extend the SMAP adaptation to a segment-based model: the Mixture Stochastic Trajectory Model (MSTM). SMAP approach is completed by the tree construction driven by adaptation data, a Minimum Description Length (MDL) structure definition of this tree and trajectory and state adaptations. On the Resource Management task, the speaker adaptation and noise adaptation experiments show that the proposed SMAP approach gives a significant improvement compared to unadapted system.
منابع مشابه
A Mixed Integer Programming Approach to Optimal Feeder Routing for Tree-Based Distribution System: A Case Study
A genetic algorithm is proposed to optimize a tree-structured power distribution network considering optimal cable sizing. For minimizing the total cost of the network, a mixed-integer programming model is presented determining the optimal sizes of cables with minimized location-allocation cost. For designing the distribution lines in a power network, the primary factors must be considered as m...
متن کاملOPTIMIZATION OF TREE-STRUCTURED GAS DISTRIBUTION NETWORK USING ANT COLONY OPTIMIZATION: A CASE STUDY
An Ant Colony Optimization (ACO) algorithm is proposed for optimal tree-structured natural gas distribution network. Design of pipelines, facilities, and equipment systems are necessary tasks to configure an optimal natural gas network. A mixed integer programming model is formulated to minimize the total cost in the network. The aim is to optimize pipe diameter sizes so that the location-alloc...
متن کاملAnnotating Speech Data for Pronunciation Variation Modelling
This paper describes methods for annotating recorded speech with information hypothesised to be important for the pronunciation of words in discourse context. Annotation is structured into six hierarchically ordered tiers, each tier corresponding to a segmentally defined linguistic unit. Automatic methods are used to segment and annotate the respective annotation tiers. Decision tree models tra...
متن کاملSound Signal Processing Based on Seq2Tree Network
Most state-of-the-art solutions to sound signal processing tasks such as the speech and noise separation task and the music style classification task are based on Recurrent Neural Network (RNN) architecture or Hidden Markov Model (HMM). Both RNN and HMM assume that the input is chain-structured so that each element in the chain is equally dependent on all its previous units. However in real-lif...
متن کاملTheoretical models for determination of weight percent of PHCS-g-PLLA co polymer using experimental data
The amphiphilic graft copolymer using chitosan (CS) as hydrophilic segment and poly (L-lactic acid) (PLLA) as hydrophobic segment, was prepared through a protection-graft-de protection route. Chitosan is a polysaccharide comprising of copolymers of glucosamine and N-acetyl glucosamine. Chitosan is the deacetylated derivative of chitin, which is one of the most abundant natural polysaccharides c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002